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Abstract. Erling F0lner proved that the amenability or nonamenabil- 
ity of a countable group G depends on the complexity of its finite subsets 
S C G. Complexity has three measures: maximum F0lner ratio, optimal 
cooling function, and minimum cooling norm. 

Our first aim is to show that, for a fixed finite subset S C G, these 
three measures are tightly bound to one another. We then explore their 
algorithmic calculation. 

Our intent is to provide a theoretical background for algorithmically 
exploring the amenability and nonamenability of discrete groups. 



1. Introduction 

A countable group G is amenable if it admits a finitely additive, translation 
invariant measure denned on every subset of G such that the measure of G 
is 1. Such groups are important in measure theory, dynamical systems, and 
ergodic theory. Nonamenable groups lead to the Hausdorff-Banach-Tarski 
paradox which asserts that one copy of the unit ball in R 3 can be rigidly 
torn into finitely many pieces that can be rigidly reassembled to form two 
copies of the unit ball. 

In [Fol54] and [Fol55], Erling F0lner gave two geometric characterizations of 
countable amenable groups G. These characterizations make these groups 
very attractive to geometric group theorists. Since we emphasize finite sub- 
sets S of G, we shall state the F0lner theorems in a slightly nonstandard way. 
In particular, F0lner's statement of Theorem 1.2 is in terms of global cooling 
functions and not local cooling functions. First we give some introductory 
definitions. 



Let r = r(67, C) denote the Cayley graph of an infinite group G with finite 
generating set C, let S C G be nonempty and finite, let E(S) denote the set 

Date: January 4, 2012. 

1991 Mathematics Subject Classification. 43A07, 20F65. 
Key words and phrases, amenable, finitely generated group. 

1 



2 



J. W. CANNON, W. J. FLOYD, AND W. R. PARRY 



of edges of T with at least one vertex in S, and let dE(S) denote the set of 
boundary edges of S consisting of those edges that have exactly one vertex 
in S. Orient each edge e of E(S) with one of its two possible orientations 
so that e has an initial vertex i(e) and a terminal vertex t(e). For the edges 
e of dE(S), choose the orientation so that i{e) 6 S. 

The F0lner ratio of S is the quotient \S\/\dE(S)\. (We will see that dE(S) + 
0.) We will never reduce this fraction except to compare sizes. That is, if we 
write that the F0lner ratio of S is a/b, then we always assume that a = \S\ 
and that b= \dE(S) | . 

A cooling function for S is a function c : E(S) — > R such that, Vs £ 5, 

h(s) = Yl < e ) - E c ( e ) ^ L 

i(e)=s t(e)=s 

One can interpret c(e) as the heat pumped along e from i(e) to t(e). Then 
h(s) is the net loss of heat at s. The cooling norm of c is |c| = max eg g( S ) l c ( e )l- 

Here are the two F0lner characterizations. For the purposes of this paper, 
the reader can take either of these F0lner theorems as defining amenability. 

Theorem 1.1 (F0lner set). The finitely generated group G is amenable if 
and only if there exist finite subsets Si C S2 C S3 C • • • exhausting G whose 
F0lner ratios \Si\/\dE{Si)\ approach 00. 

Theorem 1.2. The finitely generated group G is nonamenable if and only 
if there exist finite subsets S\ C S2 C S3 C • • • exhausting G and admitting 
cooling functions Ci : E(Si) — > R whose cooling norms |q| are uniformly 
bounded. 



Though no direct use of measure theory appears in this paper, the reader 
may appreciate an orientation to F0lner's procedures. F0lner's proofs pro- 
ceed roughly as follows. 

F0lner first proves his cooling theorem. 

If uniformly bounded cooling functions do not exist, then F0lner generalizes 
an argument of Banach which describes an outer measure on the group and 
applies a version of the Hahn-Banach theorem to create the desired measure 
on the group. It is historically interesting to note that Banach's purpose 
in proving the Hahn-Banach theorem was to prove that Abelian groups are 
amenable. 
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If uniformly bounded cooling functions do exist, then the Banach argument 
fails; and it is fairly easy to show that a measure as in the definition of 
amenable group cannot possibly exist. 

The F0lner set theorem then has an easy half and a hard half. 

The easy half assumes the existence of subsets whose F0lner ratios approach 
oo. Since these sets have relatively small boundaries, it is impossible to pump 
a lot of heat out of these sets without using cooling functions of large norm. 
Hence cooling functions of uniformly bounded norm do not exist and the 
group is amenable. 

The hard half assumes that such sets do not exist and, by an extremely 
indirect proof, F0lner proves that cooling functions with uniformly bounded 
norms do exist. 

It is the hard half of this proof that has most interest for us. Our goal is to 
make F0lner's indirect proof as direct and as algorithmic as possible. In the 
process, we streamline the F0lner proof. 

Statement 2 of Lemma 12.11 shows that cooling functions for S always exist 
in this context. Using this, it is easy to show that the two theorems are 
equivalent by application of our main theorem: 

Theorem 1.3. Let T be a locally finite graph with vertex set G and a 
nonempty finite subset S C G which admits a cooling function. Then S 
admits a cooling function c of minimum possible cooling norm N = \c\, and 

N = |c| = max\S \/\dE(S )\. 

SoCS 

That is, the minimum cooling norm is equal to the maximum F0lner ratio 
of subsets So of S, taking the F0lner ratio of the empty set to be 0. We 
call such a cooling function with minimum cooling norm an optimal cooling 
function. This absolute cooling theorem will be a corollary to the cooling 
theorem of Section [2j 

We prove the cooling theorem in Section [2] by refining F0lner's original ar- 
guments. In Section [3] we interpret this cooling theorem in terms of linear 
programming and give another proof using standard results of linear pro- 
gramming. Section [3] concludes with a brief derivation of statement 2 of the 
cooling theorem using the max flow min cut theorem. The linear program- 
ming approach in Section [3] leads to an algorithm, presented in Section HJ 
which is a modification of the standard simplex algorithm. It finds an opti- 
mal cooling function c and a F0lner-optimal subset So C S, that is, a subset 
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So C S whose F0lner ratio is the maximum possible. Moreover, the subset 
So which the algorithm finds has the property that if S' is a F0lner-optimal 
subset of S, then S' C S. In other words, there is a maximal such subset of 
S with maximum F0lner ratio, and the algorithm finds it. 

After Section HI we describe our partial results toward another algorithmic 
process for finding the minimum possible cooling norm N, finding an optimal 
cooling function, and finding a F0mer-optimal subset So C S. The main 
ingredient is the relative cooling theorem, another corollary to the cooling 
theorem of Section [2] (see Section [5]). Then we describe the process of peeling 
layers away from an arbitrary set to find a subset of maximum possible 
F0mer ratio (Section [6]). And finally we show how to build up optimal 
cooling functions layer by layer from known cooling functions on subsets 
(Section [7]). 

It can be challenging to determine whether or not a countable group is 
amenable. In particular, despite concerted efforts for over 30 years it is still 
not known whether or not Thompson's group F is amenable. And while 
Bartholdi and Virag showed in [BV05] that the Basilica group is amenable, 
their proof is via random walks and one still doesn't know how to construct 
a F0lner sequence for it. Our hope is that the results of this paper will serve 
in exploring amenability for countable groups. 

Permanent setting for the remainder of the paper: T is a locally 
finite graph; G is the set of vertices of T; S is a nonempty finite subset 
of G; E(S) is the set of edges of T having at least one vertex in S, and 
dE(S) C E(S) is the set of edges of V having exactly one vertex in S. 

Always we assume that the set S admits a cooling function. 

Edge orientation: For each edge e G E(S), we choose one of the two 
possible orientations of e, so that e has an initial vertex i(e) and a terminal 
vertex t(e). If e G dE(S), then we choose that orientation which has i{e) G S 
and t(e) S. 

We always assume given, as initial data, a function ho ■ S — > (0,oo). 
In most applications, this function will be identically equal to 1, but in the 
relative cooling theorem it is important that we be allowed to modify this 
initial data. 
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2. The cooling theorem 

Cooling functions: A cooling function rel ho is a function c : E(S) — > R 
such that 

VsGS, h(s)= c ( e )- ^ c(e)>/i (s). 

i(e)=s t(e)=s 

We may interpret c(e) as the heat pumped along edge e from initial vertex 
i(e) to terminal vertex t(e). The function is then the net heat loss at 
the vertex s. The absolute cooling theorem of the introduction deals with 
the special case where ho is constant and equal to 1. The value ho(s) gives 
a lower bound on the amount of heat to be pumped out of vertex s. The 
sum H(S) = Yl s eS^o{s) gives the minimal amount of heat that a cooling 
function must pump out of S through the boundary edges dE(S) of S. 

Recall that we always assume that S admits a cooling function. The next 
lemma provides an equivalent condition. We need two definitions for this. 
Let T be a finite subset of G. A connected component of T consists of all the 
vertices in a connected component of the graph U{e : e G E(T) \ dE(T)}. 
To say that a connected component To of T has a nonempty boundary, we 
mean that dE(T ) ^ 0. 

Lemma 2.1. (1)A nonempty finite subset T of G admits a cooling func- 
tion rel ho if and only if every connected component of T has a nonempty 
boundary. 

(2) IfT is infinite and connected, then every nonempty finite subset T C G 
admits a cooling function rel ho ■ 

Proof. Let T be a nonempty finite subset of G. 

We first prove the forward implication of statement 1. Suppose that T 
admits a cooling function rel ho- Let To be a connected component of T. 
Then H(T ) > 0. Edges of E(T )\ dE(T ) conduct no heat from T . Thus 
dE(To) 7^ 0. This proves the forward implication of statement 1. 

For the backward implication, suppose that every connected component of 
T has a nonempty boundary. Then for each x £ T there exists an edge path 
ei = (x ,xi), e 2 = (xi,x 2 ), e n = (x n -i,x n ) with x = x , with Xi G T 
for i < n, and with e n G dE(T), so that x n ^ T. Given x, transport ho(x) 
units of heat from x = xo to xi, the same from x\ to etc., until ho{x) 
units have been, in effect, transported from xq to x n with no net gain at 
any intermediate vertex. That is, viewing ei as oriented from to Xj, we 
define a function from -E(T) to R so that its value at ei, . . . , e n is ho(x) and 
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otherwise. Call this function a heat-flow path. Adding one heat-flow path 
for each x E T results in a function c : E(T) — > R satisfying h(x) = ho(x) 
for each x E T. Thus c is a cooling function rel ho- 

This proves statement 1 of Lemma 12. 1L 

To prove statement 2, suppose that T is infinite and connected. Let x E T. 
Because T is infinite and T is finite, there exists y E G \ T. Because T is 
connected, there exists an edge path in T joining x and y. Some edge in this 
edge path is in dE(T). It follows that every connected component of the 
boundary of T is nonempty. Statement 2 now follows from statement 1. 

This proves Lemma l2.1i □ 

F0lner ratio: Because we always assume that S admits a cooling function 
rel ho, Lemma 12.11 implies that dE(S) ^ 0. The F0lner ratio FR(S) = 
H(S)/\dE(S)\ measures the average amount of heat that must be pumped 
out of S along each edge e E dE(S) by any cooling function rel ho- Since 
S admits a cooling function rel ho, so does every nonempty subset So of 5*. 
So FR(S ) exists for each S , We set FR(0) = 0. 

Lemma 2.2. If So C S, then FR(So) is a lower bound on the norm of every 
cooling function on S rel ho - 

Proof. Since a cooling function must cool each subset So C S, since the 
heat in So must be carried out of So along the boundary edges of So, and 
since at least one boundary edge must carry at least the average required 
per boundary edge, it follows immediately that the minimum cooling norm 
N = \c\ of a cooling function must be at least as large as the F0lner ratio 
FR(So). □ 

Our absolute cooling theorem will show that iV is exactly equal to the max- 
imum of the F0lner ratios which occur in this lemma. 

The simplex A(5) with vertices S and the functions /: Let A(5) 
denote the abstract simplex whose vertices are the elements of the set S. An 
element of A(5) is therefore a function / : S — > [0, 1] such that Yls^s f( s ) = 
1. With each element / E A(S) we associate a function / : V — >■ [0, oo), 
where 

V = U eeE(s) de, 
Vv E S, f(v) = f(v)/h (v), and 
Vv£V\S, f(v) = 0. 
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Theorem 2.3 (Cooling). (1) If N is the minimum possible norm of a cooling 
function for S rel ho, then the reciprocal 1/N is given by the minimum value 
of a convex function A(f) on the simplex A(S) as follows: 

1/N = min A(f), 

/6A(S) 

where f is the modification of f defined above and 

eeE(S) 

(2) The minimum possible norm N is given by the maximum F0lner ratio: 

N = maxFR(So). 

SqCS 

(3) If FR{Sq) realizes the maximum in (2), then the function f G A(S') 
whose modification f is constant on Sq and onV\So realizes the minimum 
in (1). 



Proof. (1): The verification that A is a convex function is left to the reader. 
Since the function A : A(S) — > (0, oo) is positive and continuous on the 
compact set A(5), there is certainly a function in A(S) which gives a positive 
minimum for A. 

For the rest, we follow F0lner's argument. We fix a candidate norm N for 
a cooling function rel ho. We introduce a variable y e for each e G E(S) and 
note that a cooling function of norm < N is an assignment of a real value 
to each of the variables y e in such a way that the following inequalities are 
satisfied: 

Vs G S, h(s) = E Ve - E Ve - M 5 ); and 

i(e)=s t(e)=s 

Ve G E(S), y e > -N and - y e > -N. 

The first of these inequalities says that each element of S is cooled by the 
required amount. The last two inequalities say that the norm of the resulting 
cooling function is < N. 

We assume that N is too small, so that no cooling function exists, and con- 
clude from Farkas' lemma, Lemma f2.9t that there exist nonnegative numbers 
a(s), /3(e), and 7(e) satisfying the following two conditions: 



(2.4) 



Y^a{s)h{s)+ £ (/3(e) - 7 (e))y e = 0, 

sGS eG-E(S) 
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and 

(2.5) Y,a(s)ho(8) - N- Y, (/3(e) + 7(e)) >0. 

seS e£E(S) 

We cannot have a(s) identically 0, for then the left-hand side of line 12,51 
would be < 0, a contradiction. We may therefore scale the coefficients a(s), 
/3(e), and 7(e) so that ^a(s)/io(s) = 1. That is, we may assume that 
f = a ■ ho G A(S'). We may assume the function a is extended to the 
vertices of dE(S) not in S so as to be on vertices not in S. It follows that 
this extended a is our standard modification / of the function /. 

We normalize /3(e) and 7(e) as follows: We replace the larger of /3(e) and 
7(e) by the nonnegative difference |/3(e) — 7(e) | and the other by 0. This 
has no effect at all on the condition in line 12.44 an d it can only increase the 
sum in line 12.51 so the two conditions are still satisfied. 

We claim that /3(e) + 7(e) = \a(i(e)) - a(t(e))\ = \f(i(e))-f(t(e))\. Indeed, 
the coefficient of y e in line 12.41 is 0, and so 

(2.6) a(i(e)) - a(t(e)) + /3(e) - 7(e) = 0. 

Because of our normalization, one of /3(e) and 7(e) is 0, and so | /5(e) + 7(e)) = 
I /5(e) — 7(e)| . Hence 

/3(e) + 7(e) = |/3(e) + 7 (e)| = |/3(e) - 7 (e)| = \a(i(e)) - a(i(e))|, 

as desired. 

With £a(a)fco(s) = 1 and /3(e) + 7(e) = \a{i(e)) - a(t(e))| = |/(i(e)) - 
/(i(e))|, we find that line 12.51 is equivalent to 

A(f)= £ |/(<(e))-/(t(e))|<l/JV. 

Thus cooling functions of norm N exist if l/N < minjg^s') A(f)- 

The converse of this statement can be proved by assuming that there exists 
/ G A()S) with A(f) < l/N and reversing this argument. The function / 
determines the numbers a(s) as above. We choose the numbers /3(e) and 
7(e) as above so that one of them is and so that line 12.61 is satisfied. We 
conclude that lines 12.4 1 and 12.51 hold. Now Farkas' lemma implies that there 
is no cooling function with norm N. So cooling functions with norm TV exist 
if and only if l/N < minj e ^(5) ^(/)- 

This proves (1). 

(2) and (3): Lemma 12.21 shows that the minimum possible cooling norm N 
for S must be at least as large as max£ oC s FR(Sq). Our remaining task is 
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to find a subset So of maximum F0lner ratio and to show that the function 
g G A(S) associated with So as in (3) does in fact realize the minimum 
A(g) = min /6A(s) A{f) in (1). 

Suppose that A assumes its minimum at / G A(S). We let So C S denote the 
set of points in S at which the corresponding function / takes its (positive) 
maximum. We shall see that this set So satisfies the required conditions. 



As in (3), we let g be constant and positive, equal to 5, on So, and constant, 
equal to 0, on the complement of So- The corresponding element g G A (S) 
therefore satisfies the equation 

l = J>(a)= ^25-ho(s) = 5-H(So). 

sgs ses 

Thus 5 = l/H(So)- It is easy to calculate A{g) since \g{i(ej) — g(t(e))\ is 
only nonzero for e G dE(So). Therefore, 

(2.7) A(g)= £ \me))-me))\=S^dE(S )\= l -^lf = ¥ ^- 

We know that A{f) realizes the minimum in (1). We complete the proof by 
showing that A(f) > A(g) so that the latter also realizes the minimum in 

(!)• 

We apply the methods of the calculus of variations. We consider the func- 
tions f\ G A (5) defined as follows: 

fx = — — for A very small in [0, 1). 
1 — A 

Note that we are simply reducing some of the positive values of / by a little 
bit and are then scaling so as to remain in A(S). The following lemma 
completes the proof. 



Lemma 2.8. 

d(A(fx)) 



< 



= A(f)-A(g). 

A=0 



Proof. The derivative is certainly > since A(f) minimizes A. 

It is an easy matter to take the derivative of \x\ if we know which of the 
two options |x| = 1 ■ x or \x\ = — 1 • x is true. Luckily, we can determine 
appropriate signs ±1 for each of our three absolute values \ f(i(e)) — f(t(e))\, 
\f\(i(e)) — fx(t(e))\, and \g(i(e)) — g(t(e))\, which appear in the defining 
formula for A(f), A(f(X)), and A(g), and those signs may be chosen com- 
patibly. 
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To each edge e S E(S) we assign a number e(e) = ±1 as follows: if \f(i(e)) — 
/(*(e))| = /(i(e)) - /(t(e)), then e(e) = 1; otherwise, e(e) = -1. 

For A sufficiently small, the function f\ will clearly require the same signs 
e(e). 

Because the function g is positive only on the elements of So, where / takes 
on its maximum value, g will require the same sign provided that exactly 
one end point of e is in So- If neither end point of e is in So or if both end 
points of e are in So, then g{i{e)) — g(t(ej) = and we may use the same 
value of e(e) for g as has already been chosen for / and f\. 

Thus we may choose multipliers e(e) = ±1 so that we have the following 
three equalities for all A sufficiently small: 

|/(i(e)) - f(t(e))\ = 6(e) • (f(i(e)) - f(t(e))) 
|/ A (i(e)) - fx(t(e))\ = e(e) • (/ A (i(e)) - / A (t(e))) 

|5(<(e))-j(t(e))|= e (e)-(sl(i( e ))-5(t(e))). 
We would be unable to obtain compatible signs e(e) if So had been chosen 
via anything but the maximum value of /. With all absolute value signs 
replaced by constants e(e), it is an easy matter to calculate d(A(f\))/d\ by 
the quotient rule, and the result is as stated in the lemma: 



d(A(f x ))/d\ 



= E.<e)(/(i( e ))-/(t(e)))i fVI=x 

A=0 



A(f)-A(g). 



A=0 



A=0 



□ 



This argument completes the proof of the cooling theorem. □ 
Proof of the absolute cooling theorem. Set ho = 1 • Q 



We conclude this section with a discussion of Farkas' lemma. In Section 4 of 
[Fol55| . F0mer states essentially the following result, which he presents as a 
minor modification of a result proved by Carver in Theorem 3 of |Car22j . 

Lemma 2.9 (Farkas' lemma). Let L : M. n — > R m be a linear map and let 
b G M m . Then a necessary and sufficient condition that there be no solution 
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to the inequality L{x) > b is that there exist a nonnegative vector a G 
a > 0, such that L{x) ■ a = but b ■ a > 0. 



This is one of the equivalent formulations of Farkas' lemma. To see this, we 
apply Corollary 7.1e on page 89 of Schrijver's book [Sch99] . This corollary 
is one form of Farkas' lemma. It states the following, a bit loosely. Let 
A be a matrix and let b be a vector. Then the system Ax < b of linear 
inequalities has a solution x, if and only if yb > for each row vector y > 
with yA = 0. Replacing b with —b, yields the following. The system Ax > b 
of linear inequalities has a solution x, if and only if yb < for each row 
vector y > with yA = 0. Lemma 12.91 is the contrapositive of this. 



3. Linear programming 

In this section we give another proof of the cooling theorem, Theorem 12. 3( 
using standard results from linear programming. Murty |Mur76| and Schri- 
jver [Sch99] are good references for linear programming. We maintain the 
notation of Section [21 

We wish to minimize the maximum value of 

c(e),-c(e) VeeE(S) 

subject to the conditions 

i(e)=s t(e)=s 

In the spirit of linear programming, we introduce a variable x e for every 
e 6 E(S) and restate this problem as follows. Minimize the maximum value 
of 

x e , —x e Ve S E(S) 

subject to the conditions 

x e — x e > ^o( s ) Vs € S. 

i(e)=s t(e)=s 

Now we introduce one more variable Xh and in effect replace x e by x e divided 
by the maximum value of {\x e \ : e £ E(S)} to see that our problem is 
equivalent to the following problem. Maximize Xh subject to the conditions 

ho(s)xf l — x e + x e < Vs G S 

(o_±\ i(e)=s t(e)=s 

o- ^ 1 1 

Ve E E(S). 



X e < 1 
-X P < 1 
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The solution of this problem is the inverse of the solution of our original 
problem. 

This is a linear programming problem in the form maxjcx : Ax < b} as in 
line 13 on page 90 of Schrijver's book [Sch99]. Here x is the column vector 
of variables Xh, x e for e G E(S). We have that c is the row vector with as 
many components as x with the component corresponding to Xh being 1 and 
all other components being 0. Similarly, b is the column vector of right side 
constants in line 13.11 and A is the coefficient matrix of this system of linear 
inequalities. (The A here is not to be confused with our previous A.) 

Now we apply the duality theorem of linear programming. Just as Farkas' 
lemma, which we encountered at the end of Section [2j has a number of 
formulations, so does the duality theorem. We apply the formulation given 
in Corollary 7.1g on page 90 of Schrijver's book [Sch99| : 

maxjcx : Ax < b} = min{yb : y > 0, yA = c}, 

provided that both of these sets are nonempty. The origin is in the first 
set, so the first set is nonempty. We will soon see that the second one is 
nonempty too. 

So we now have new variables y s for every s £ S, one for every inequality in 
line 13.11 corresponding to an element of S. We also have new variables y e and 
z e for every e G E(S) corresponding to the two inequalities in line 13.11 for 
every e G E(S). Furthermore y is the row vector whose entries are these new 
variables. The duality theorem transforms our problem into the following 
one. Minimize 



-Vi(e) + Vt(e) +y e ~ z e = Ve G E(S) 
y s >0 Vs G S y e >0 Ve G E(S) z e >0 Ve G E(S). 

The last equation requires the convention that y t ^ = if t(e) ^ S. 

Suppose that we fix the variables y s and let the variables y e and z e vary. 
Then y e — z e is fixed for every e G E(S). If y and z are nonnegative real 
numbers with y — z = a, a fixed value, then the smallest possible value for 
y + z is \a\. Thus our problem is equivalent to the following one. Minimize 



e€E{S) 



subject to the conditions 





e&E(S) 
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subject to the conditions 

^2h (s)y s = 1 
y s >0 Vs 6 S. 

Now it is clear that our second set is also nonempty, and we have statement 
1 of the cooling theorem. 

To prove statements 2 and 3 of the cooling theorem, note that the map A 
is not only convex but piecewise linear. In fact, its restriction to every cell 
of the first barycentric subdivision of A (S) is linear. As for general linear 
programming problems, it follows that its minimum occurs at a vertex of 
the first barycentric subdivision of A(S). (See Section 3.5.5 of Murty's book 
[Mur76] or Section 8.3 of Schrijver's book [Sch99].) These vertices are the 
points / G A(5) such that / is constant on some subset So of S and on 
S \ Sq. From the definition of A, we see that if / is such a point of A (5), 
then 

A(f) _ \dE(S )\ 1 
U> \S \ FR(S y 

as in line 12.71 

This completes our linear programming proof of the cooling theorem. 

We conclude this section with a brief discussion involving the max flow 
min cut theorem. Given the close connection between the max flow min 
cut theorem and linear programming, (See Section 7.10 of Schrijver's book 
[Sch99| .) it should not be surprising that the max flow min cut theorem is 
relevant to our problem. 

We apply Theorem 1.1 on page 38 of Ford and Fulkerson's book [FF62J We 
take the sets R and S there to be empty and the set T there to be our set 
S. The set X there is our So- The function / there is our cooling function 
c. The function b there is our ho- The function c there is our norm n = \c\. 
The conclusion is that the real number n is the norm of a cooling function 
c on S rel ho if and only if H(So) < n\dE(So)\ for every So C S. Thus 
N = maxs oC s FR(So)- This is statement 2 of the cooling theorem. 

4. The modified simplex algorithm 

The previous section shows that the problem of finding an optimal cooling 
function is a linear programming problem. As such, it can be solved by the 
simplex method. In our situation every step of the simplex method proceeds 
as follows. We have a cooling function c, and the simplex method finds the 
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vertices at which c is not optimal. One chooses one of these vertices, and the 
step concludes with a computation which improves c at the chosen vertex. 
This algorithm has the defects that at every step the norm of c usually does 
not decrease, there is this choice of vertex, and poor choices might lead to 
an infinite loop, although a further enhancement of the algorithm can avoid 
infinite loops. 

The algorithm which we present in this section is a greedy algorithm in that 
at every step we improve our cooling function c at every vertex at which c is 
not optimal. Because we deal with every bad vertex, the norm of c decreases 
at every step. In our algorithm the numerator of \c\ is always bounded by 
\S\ and the denominator of |c| is bounded by |.E(S)|. Since |c| decreases 
at every step, the number of steps is therefore bounded by \S\ \E(S)\. The 
number of operations per step is linear in Thus the number of 

operations required for our algorithm to find the optimal cooling function 
is cubic in \E(S)\. In practice it seems to be quadratic. The steps in our 
algorithm are far more complicated than the steps of the simplex method, 
but there are far fewer of them. In our limited experience, our algorithm is 
faster. Moreover, the subset Sq of S with maximal F0lner ratio which our 
algorithm finds also has the property (See Theorem 14.11 and the discussion 
immediately preceding it.) that if T is a subset of S with maximal F0lner 
ratio, then T C So. 

Here is our modified simplex algorithm. 

Let r be a locally finite graph, as usual. The edges of T are initially undi- 
rected. However, we often direct the edges of T, choosing directions to suit 
the occasion. If e is a directed edge of T, then, as usual, we let i(e) denote 
the initial vertex of e, and we let i(e) denote the terminal vertex of e. 

Now we fix a nonempty finite set S of vertices of T, as usual. We direct the 
edges of E(S) so that every edge of dE(S) is directed away from S. Let h 
be a nonnegative real number. A cooling function for S relative to h is a 
function c : E(S) — > R such that 



We refer to this inequality as the vertex condition or vertex inequality at v. 

The absolute cooling theorem deals with the case in which h = 1. It states 
that S admits a cooling function relative to h = 1 of minimum norm, and 

this norm is N = maxFR(Sr\). In other words, there exists a maximum 

SoCS 

value of h such that S admits a cooling function relative to h with norm 




for every v G S. 



i(e)=v 



t{e)=v 



AMENABILITY, F0LNER RATIOS, AND COOLING FUNCTIONS 



15 



1, and this maximum value of h is N 1 = m.inFR(So) 1 . In this way we 

SoCS 

are led to the problem of maximizing h over all cooling functions for S with 
norm 1. This is what our modified simplex algorithm does: it maximizes h 
over all cooling functions for S with norm 1. For every edge e G E(S), we 
refer to the inequality \c(e)\ < 1 as the edge condition or edge inequality at 
e. 

Our modified simplex algorithm proceeds in steps, starting with step 0. 
After n steps we have the following. We have a forest F n , which is a subgraph 
of r. The vertex set of F n equals S. Every connected component of F n is 
a rooted tree. We direct every edge of F n down toward the root of its 
component. We also have a set R n C E(S) \ dE(S). No element of R n is 
an edge of F n , and every element of R n is directed. We direct the edges of 
E(S) compatibly with F n , R n and dE(S). We also have a nonnegative real 
number h n . A cooling function for (S, F n , R n , h n ) is a function c : E(S) —> R 
satisfying the following conditions. 

(1) c(e) = 1 if e G R n U dE(S) 

(2) c(e) = if e is neither an edge of F n nor in R n U dE(S) 

( 3 ) J2i( e )=v c ( e ) ~~ Et(e)=» c ( e ) = for ever y element v of 5 which is 
not a root of F n 

( 4 ) 12i(e)=v c ( e ) " J2t(e)=v c ( e ) > h n for every root v of F n 

In addition to the above, we have a cooling function c n for (S, F n , R n ,h n ) 
with norm 1 which never takes the value —1. Finally, h n is maximal with re- 
spect to the property that there exists a cooling function for (5, F n , R n , h n ) 
with norm 1. Given such a quadruple (F n , R n ,h n ,c n ), the algorithm con- 
structs another one (F n +i,R n+ i,h n +i,c n +i) with h n+ \ > h n unless the al- 
gorithm finds a subset of S with maximum Folner ratio. This eventually 
occurs, the algorithm finds a subset of S with maximum Folner ratio /i" 1 
together with an associated cooling function and the algorithm stops. 

The situation at step is as simple as possible. The forest Fq has no edges. 
Its connected components, the elements of S, are trivial rooted trees. The 
set i?o is empty. The cooling function cq takes the value 1 on dE(S) and 
the value elsewhere. The maximality of ho then implies that 

ho = min|{e G dE(S) : i(e) = v}\. 
All conditions are satisfied. 

Now let n be a nonnegative integer, and suppose that we have F n , R n , h n 
and c n as above. We prepare to construct F n+ ±, Rn+i, h n+ \ and c n+ \ in the 
next paragraph. 
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Suppose that the edge inequality satisfied by c n is strict for every edge of 
F n . Let T be a nontrivial connected component of F n , and let v be a leaf 
of T. (Roots are not leaves.) Because the edge inequality satisfied by c n 
at the edge e of T which contains v is strict, it is possible to send slightly 
more than c n (e) units of heat toward the root of T along e while satisfying 
the vertex equality at v for a value slightly larger than h n . We modify c n in 
this way at every edge of T to satisfy the vertex equality at every vertex of 
T other than its root for a real number slightly larger than h n . This applies 
to every such connected component of F n . Thus because h n is maximal, 
the vertex inequality satisfied by c n for h n at some root of F n is actually an 
equality. We conclude that either the edge inequality satisfied by c n at some 
edge of F n is actually an equality or the vertex inequality satisfied by c n for 
h n at some root of F n is actually an equality. We observe that because c n 
never takes the value —1, if e is an edge of F n , then |c n (e)| = 1 if and only 
if c n (e) = 1. 

We digress briefly in this paragraph to show that /i" 1 is a relative F0lner 
ratio. According to the previous paragraph there exists either a root or an 
edge of F n such that the inequality satisfied by c n at either this root or edge 
is actually an equality. If this equality holds at a root, then let v be this 
root, and if this equality holds at an edge, then let v be the upper vertex 
of this edge. Let So be the set of those vertices in S which are either equal 
to or above v relative to F n . By assumption the vertex inequality satisfied 
by c n for h n is actually an equality at every element of Sq other than v , 
and now this is even true at v. We combine these equations, one for every 
vertex of Sq. We use the facts that c n (e) = 1 for every edge e 6 R n U dE(S) 
and c n (e) = if e is neither an edge of F n nor in R n U dE(S). We also use 
the fact that if v is not a root, then the edge of F n immediately below it is 
directed away from So and c n has value 1 at it. Letting d'E(So) denote the 
set of edges of either dE(S) or F n which contain exactly one element of So, 
we obtain that 



\d'E(S )\+ E X - E 1 = l* 

e€R„ e€i?n 
i(e)eS t(e)eSb 

Hence h n = i?Fi?(S'o)~ 1 , where 



RFR(So) = \S \ 



\ 



e&R n 
i(e)eS 



\ 



\d'E(S )\+ E X - E 1 



e€R n 
t(e)€So ) 



We view RFR(So) as a relative F0lner ratio. It is the F0lner ratio of So 
relative to S, F n and i? n . 
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We return to the result of the penultimate paragraph. Either the edge 
inequality satisfied by c n at some edge of F n is actually an equality or the 
vertex inequality satisfied by c n for h n at some root of F n is actually an 
equality. We find every such root and edge. Every such root loses its status 
as root. Every such edge, but not its vertices, is removed from F n . The 
result is a forest, which we denote by F' n . Every edge just removed from 
F n , directed as in F n , is added to R n . The result is a set of directed edges, 
which we denote by R' n . The forest F' n has connected components without 
roots. We now construct F n+ \ by inductively enlarging the rooted connected 
components of F' n . We adjoin certain edges to F' n to construct a maximal 
forest with at most one root per connected component. The edges which 
we adjoin are edges of E(S) \ dE(S) which are either not in R' n or directed 
away from the rooted connected components. So, if possible we adjoin such 
an edge to F^, joining one of its rooted connected components with one of 
its unrooted connected components. The result is a forest, one of whose 
rooted connected components contains two connected components of F' n . If 
possible we adjoin such an edge to this new forest, joining one of its rooted 
connected components with one of its unrooted connected components. We 
continue in this way as long as possible. This final forest is F n+ \. 

Suppose that F n+ \ has a connected component which is not rooted (which 
occurs if F' n has no roots). Let Sq be the set of elements of S which are 
not contained in rooted connected components of F n+ \. Then every edge in 
dE(S$) is either in dE{S) or it is in R' n and directed away from Sq. Hence 
c «( e ) = 1 f° r every e £ dE(So). It follows that c n is removing as much heat 
from So as is possible while maintaining norm 1. As in the paragraph which 
discusses relative F0lner ratios, there is no cooling function for S with norm 
1 relative to a value larger than h n = RFR(So)^ 1 = FR^Sq) -1 , where this 
relative F0lner ratio is computed relative to R' n . In this case So is a subset 
of S with maximum F0lner ratio, and the algorithm stops. 

Suppose that the algorithm does not stop at step n. Then every connected 
component of F n+ \ is rooted. We define R n +\ to be the set of directed 
edges gotten from R' n by deleting those edges of R' n which became edges 
of F n+ \. If v is a root of F n at which the inequality satisfied by c n for 
h n is an equality, then v is not a root of F n+ ±. If e is an edge of F n 
at which the inequality satisfied by c n is an equality, that is, c n (e) = 1, 
then either e is not an edge of F n+ \ or it is directed away from the root 
of its connected component. The cooling function c n need not be a cool- 
ing function for (S, F n+ ±, R n +i, h n ) mostly because some edge directions 
might have changed. By simply changing some signs if necessary, c n de- 
termines a cooling function c' n for (S, F n+ i, R n+ \, h n ) except for possibly 
taking the value —1 at some edges. The value of c' n at every edge of F n+ \ 
is strictly less than 1. As at the beginning of the passage from step n to 
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step n + 1, this implies that by appropriately increasing every value of c' n 
by a positive amount we obtain a cooling function for (S, F n+ i, R n +i, h n ) 
which satisfies every conditional inequality for a real number strictly larger 
than h n . Because the vertex inequality for c' n is an equality for every ver- 
tex of F n+ \ which is not a root, every value of every cooling function for 
(S, F n+ i, Rn+i, h' n ) with h' n > h n is at least as large as the corresponding 
value of c' n . Maximizing, we obtain a real number /i ra +i and a cooling func- 
tion c n+ i for (S, F n+ i, R n+ i, h n+ i) with norm 1 which never takes the value 
— 1 such that h n+ \ is maximal with respect to the property that there ex- 
ists a cooling function for (S, F n+ \, Rn+i, /i n +i) with norm 1. The quadruple 
(F n+ i, Rn+i, h n+ \,c n+ i) satisfies all required conditions. This completes the 
description of the algorithm. 

In this paragraph we show that this algorithm ends with a solution after 
finitely many steps. We have seen that h~ x is a relative F0lner ratio for every 
n. Hence the numerator of h n is bounded by |£ ; (5')| and the denominator of 
h n is bounded by \S\. There are only finitely many possibilities for h n . Since 
the sequence ho, h±, hi, ■ ■ ■ is strictly increasing, it follows that the algorithm 
ends after finitely many steps. 

The proof of the following theorem is based on this modified simplex al- 
gorithm. By comparing the proof and the algorithm, one sees that the 
algorithm finds the subset So of the theorem. 

Theorem 4.1. The set S contains a maximal F0lner- optimal subset So, that 
is, a subset So with maximum F0lner ratio, such that every F0lner- optimal 
subset of S is contained in So- 

Proof. After choosing directions for the edges of E(S), the absolute cooling 
theorem implies that there exists a cooling function c : E(S) —> R with norm 
1 rel ho, where ho is the inverse of the maximum F0lner ratio of a subset 
of S. Without loss of generality we assume that if e G dE(S), then e is 
directed away from S and that c(e) = 1. 

Let T be a F0lner-optimal subset of S. So h^ 1 = FR(T). We direct every 
edge of dE(T) away from T. By definition c satisfies the vertex inequality, 

i(e)=v t(e)=v 

for every v G S, and so this holds for every v G T. We combine these vertex 
inequalities, one for every element of T, and obtain 

E c{e)>\T\ho = \dE{T)\. 

e£dE(T) 
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Since |c(e)| < 1 for every e G dE(T) and the number of these summands 
equals \dE(T)\, we can make two conclusions. One is that every vertex 
inequality for T is actually an equality. The other is that c(e) = 1 for every 
e G dE(T). 

We use these two conclusions to construct the maximal subset of S with 
maximum F0lner ratio. For this we define a (nonreflexive, nonsymmetric 
and nontransitive) relation on S as follows. Given u, v G S, we have that 
u ~ v if and only if there exists e G E(S) directed so that t(e) = u, i(e) = v 
and c(e) < 1. (The condition c(e) < 1 is equivalent to c(e) ^ 1.) Here 
we view c as being defined not only for every edge e G -E'(S') with its given 
direction but also for the opposite edge e, so that c(e) = — c(e). 

Let X be the set of vertices of 5 whose associated vertex inequality is strict. 
Let X be the closure of X under the above relation. Conclusion one from 
above implies that every subset of S with maximum F0lner ratio is contained 
in the complement of X. Conclusion two then implies that every subset of 
S with maximum F0lner ratio is contained in the complement of X . 

Now let Sq be the complement of X. To prove the theorem it suffices to 
prove that FR(S ) = h^ 1 , the maximum F0lner ratio of a subset of S. For 
this we first observe that every vertex inequality is actually an equality for 
every element of So. Furthermore, if e G dE(So) is directed away from So, 
then c(e) = 1. Combining the vertex equalities associated to So as in the 
second paragraph of this proof, we find that \dE(So)\ = \So\ho- This implies 
that FR(S ) = h^ 1 . 

This proves Theorem 14.11 □ 

5. The relative cooling theorem 

Our plan for the rest of the paper is to construct cooling functions by finding 
a F0lner-optimal subset So C S, building an optimal cooling function Co on 
So, and then building cooling functions on larger and larger subsets of S, 
extending cq layer by layer. A cooling function on a smaller subset pumps 
heat into the layers not yet considered. Thus we need to know when the 
extension is possible. This problem is dealt with by means of the relative 
cooling theorem, which is a corollary to the cooling theorem of Section [2j 
The only twist in the argument is that we do not apply the theorem to a 
subgraph of Y but rather to a slightly modified subgraph. 

Setting: We assume V, G, S, E(S), and dE(S) given, with edges e G E(S) 
oriented as before. 
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The modified graph T' s and modified vertex set S': We form a new 
graph T' s from the graph Ts = U{e | e G E(S)} by splitting all edges e G 
dE(S) apart at any common vertex in the complement of S so that the 
boundary edges have distinct terminal vertices in the complement of 5. Let 
S' be a subset of the set of vertices of T' s with S C S'. 

Initial conditions: We assume given a function ho : S' — > (0, oo). 

We assume: S' admits a cooling function rel ho- 

Lemma 12.11 implies that S' admits a cooling function rel ho if and only if 
every connected component of S' has a nonempty boundary. 

We immediately obtain the following relative cooling theorem, which is sim- 
ply the general cooling theorem of Section [2] applied to our modified graph 

1 s- 

Theorem 5.1 (Relative cooling). (1) If N is the minimum possible norm of 
a cooling function rel ho, then the reciprocal 1/N is given by the minimum 
value of a convex function A(f) on the simplex A(S") as follows: 

1/N = min A(f), 

where f is the modification of f defined above and 

A(f)= £ |/(<(e))-/(t(e))|. 

(2) The minimum possible norm N is given by the maximum F0lner ratio: 

N = max FR(So). 

SoCS' 

(3) If FR(So) realizes the maximum in (2), then the function f G A(S") 
whose modification f is constant on So and on S'\So realizes the minimum 
in (1). 



We shall apply this relative cooling theorem in Section [6l 



6. Peelings 



We assume V, G, S, E{S), dE(S), and ho ■ S — > (0,oo) given as before. 
Our goal is to find a F0lner-optimal subset So C S. We propose to do so by 
peeling layers away from S until we find the desired set. 
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Let P be a nonempty subset of S, and let T = S \ P. We have the (split) 
graph T'p from the previous section. Let P' be the subset of T' p consisting 
of P together with the boundary vertices of T' p arising from (the splitting 
of) points of T. If Pq C P', then we let B(Pq) denote the cardinality of 
Pq\P and we let B'(Pq) denote the number of edges in T' p having exactly 
one vertex in Pq. [Note that if x 6 Pq \ P is not joined by its unique edge 
e in T'p to a point of Pq, then x is counted in B(Pq) and e is counted in 
B'(Pq).} We call P a peeling for S if 

vp c p', h(p n P) < fr(t)(b'(Pq) - b(Pq)). 

The next lemma gives an alternate characterization of peelings. 

Lemma 6.1. Let N be a positive real number. In the above setting, we 
extend Hq\p : P — > (0,oo) to P' by defining ho(x) = N for each x E P'\P. 
Then FR{P ) < N if and only ifH(P nP) < N(B'(P ) - B(P )) for every 
Pq C P' . In particular, taking N = FR(T) shows that P is a peeling for S 
if and only if FR(P ) < FR(T) for every P C P' . 

Proof. Let P C P' . Then 

Fn(p) _ N.B(P ) + H(P nP) 

FR{Po) ~ nPo] • 

So FR(P ) < N if and only if 

N • B(P ) + H(P HP)<N- B'(P ) 

if and only if 

H(P Q nP)<N(B'(P )-B(P )). 
This proves Lemma 16.11 □ 



The next lemma gives a basic property of peelings. 

Lemma 6.2. If P is a peeling for S, then every connected component of 
P' C T'p has a nonempty boundary. 

Proof. Let Pq be a nonempty subset of P' . Prom the definition of peeling, 

H(P HP)< FR(T)(B'(P ) - B(P )). 

If B{P ) = 0, then P C P, and so H(P n P) / 0. It easily follows that 
B'(Pq) is positive for every nonempty subset Po of P'. In particular, it is 
positive if Po is a connected component of P' . So some edge of T' p contains 
exactly one vertex of Pq . This proves Lemma 16.21 □ 



Peelings are important for two reasons: 
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Theorem 6.3. If P is a peeling for S = PUT, then every cooling function 
for T rel ho can be extended to a cooling function for S rel ho without 
increasing the cooling norm. 

Theorem 6.4. If T is a proper subset of S which is F0lner- optimal, then 
S \ T is a peeling for S. 

These two theorems apply as follows: A subset S can be peeled until a subset 
So is obtained that admits no peeling. When that happens, Theorem 16.41 
implies that FR(So) is as large as the F0lner ratio of any of its subsets. 
Since S admits a cooling function rel ho, so does So- Hence, by the cooling 
theorem, the set Sq admits a cooling function Co of cooling norm FR(Sq). 
By Theorem 16.31 this cooling function extends to a cooling function for S 
of norm FR(So)- We conclude that FR(So) maximizes the F0lner ratio of 
subsets of S, that FR(So) is the minimal cooling norm for S, and that the 
extension of cq is an optimal cooling function for S. The following problems 
remain: 

Problems. Give an efficient algorithm for determining when a peeling 
exists. Give an efficient algorithm for finding a peeling when a peeling 
exists. 

Proof Ii6. 3\) . Suppose we are given a cooling function for T rel ho- We can 
modify this function so that it is constant on all boundary edges of T and 
equal to the norm TV" of the function on those edges. It suffices to show that 
this modified cooling function extends to a cooling function for S rel ho, for 
returning the values of the extension on the boundary edges of T to their 
original values will not destroy the cooling properties of the extension. 

From this point, we concentrate on P' C T' p . We extend ho\p '■ P -4 (0, oo) 
to P' by defining ho(x) = N for each x E P'\P. Lemmas 16 . 21 and 12 . 1 1 combine 
to show that P' admits a cooling function rel ho- Hence the relative cooling 
theorem applies to this situation. It implies that the smallest possible cooling 
norm for P' rel ho is M = m&x{FR(Po) \ Po C P'}. Lemma 16.11 implies 
that M < FR(T) < N. Hence a cooling function for P' with norm M 
provides an extension of the original cooling function for T. This proves 
Theorem 16.31 □ 

Proof ffi.^| j. Let T be a proper subset of S whose F0lner ratio N = FR(T) 
is maximal among all subsets of S. Let P = S \ T, a nonempty subset of 
S. We prove that P is a peeling for S. Indeed, statement 2 of the cooling 
theorem implies that there exists a cooling function c : E(S) — > 1R for S rel 
ho of norm N. This function also cools T rel ho- With the edges of dE(T) 
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oriented away from T, the values of c on them must be constant with value 
N, for, if some boundary value carries less than the average N required of 
the boundary edges of T, then another would have to carry more than the 
average, a contradiction. 

We return to T' p as in the proof of Theorem 16.31 We again extend ho\p '■ 
P — > (0, oo) to P' by defining ho(x) = N for each x G P' \ P. The edges of 
E(P) are in canonical bijective correspondence with the edges of T' p , and so 
c determines a function d from the edges of T' p to M. It is clear that d cools 
P viewed as a subset of F' p , and c' also cools P' \ P because every edge of 
dE(T) carries N units of heat away from T. So c' is a cooling function rel 
ho with norm N. Now we apply statement 2 of the relative cooling theorem 
to conclude that FR(P ) <N = FR(T) for every P C P' . Now Lemma[6J] 
implies that P is a peeling for S. □ 



7. Building optimal cooling functions for Z © Z 



Our eventual goal (unfortunately, only partially completed in this paper) 
is to find an efficient algorithm to find a subset Sq <Z S having maximum 
F0lner ratio FR{Sq). The steps proposed are these: 

(1) By peeling, find a subset T C S that allows no further peeling (or at 
least seems to allow no further peeling). 

(2) Prove that T is self-optimal (FR(T) = max{FR(T ) \ T C T}) by build- 
ing a cooling function for T of norm FR(T). 

(3) Extend the cooling function on T to a cooling function on S without 
increasing the cooling norm. 

The relative cooling theorem, as applied in Theorem 16.31 completes step 3. 

In the remainder of this section, we show how these steps can be carried 
out for the n-ball B(n) in the free Abelian group Z © Z with its standard 
two- generator generating set. This ball is shaped like a diamond. Asymp- 
totically, the F0mer-optimal subset of B(n) which we find is shaped like a 
regular octagon (stop sign). To guess the optimal shape, we simply look at 
successive F0lner ratios as layers are removed at the corners: right, left, top, 
and bottom. Our first task is to determine when one of these edge layers is 
in fact a peeling. (Recall Section [H) The theorems that follow will imply 
that, in these special cases, we need only compare a single fraction m/2 with 
the F0lner ratio of the set before the deletion. 
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Setting: We consider the Cayley graph r of G = Z 2 with standard gener- 
ators and initial condition ho = 1. We consider TcZ 2 finite, T a subset 
of the lower open half-plane and containing ([1, m] x {—1}) n Z 2 for some 
positive integer m. We consider P = ([1, m] x {0})nZ 2 . We ask whether P 
is a peeling for PUT. We return to P' C Tp as in Section [6j 

Theorem 7.1. (%) //P C P'\P, thenH(P nP) = and B' (P ) - B (P ) = 
0. 

("HJ //P C P' and P <£ P'\P, then B'(P )-B(P ) > and f/ie maximum 

H(P n P) 

P'(Po) - P(P ) 
is realized by the set Pq = P' with value m/2. 

Corollary 7.2. The set P is a peeling for S = T U P if and only if m/2 < 
FR(T). 

Remark. Of course, the search for a peeling P begins not with T but with 
S = T U P. We shall show later that this necessary inequality is equivalent 
to the inequality with FR(S) replacing FR(T) so that, as P changes, we 
need not calculate the F0lner ratio of the new T but can instead retain the 
previously calculated old ratio FR(S). 



Proof (Theorem 7.1). Statement 1 and the inequality B [Pq) — B(Pq) > 



in statement 2 are easy to check. So let Pq C P 1 with Pq <f_ P'\ P. We alter 
Pq by successive moves that can only increase the fraction being maximized. 

There is a canonical injective graph morphism from T' p to T, which we use 
to identify T' p with a subgraph of T. 

Without changing B(Pq) and H{Pq n P), we may permute the elements of 
P in such a way that all elements of Pq n P appear consecutively beginning 
at (1,0). This can only decrease B'(Pq) and thereby increase the fraction 
being maximized. 

If (a, 0) £ Pq, then we may assume that (a, —1) G Pq, for the insertion of 
(a, —1) into Pq will increase B(Pq) by 1, will decrease B'(Pq) by 1, and will 
leave P^(Po H P) unchanged. 

If (a, —1) G Pq but (a, 0) ^ Pq, then we may assume such points appear just 
after the pairs described in the paragraphs above. Beginning with the left- 
most exemplar, we insert (a, 0) into Pq. This increases H by 1 and leaves 
B(Pq) unchanged. Because Pq <f. P' \ P, it also leaves B'(Pq) unchanged. 
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If some (a,0) G P does not appear in Pq, then beginning with the leftmost 
such a, we insert both (a, 0) and (a, —1). This increases H by 1, -B(-Po) by 
1, and B'{P Q ) by 1. 

This easily proves Theorem 17.11 □ 



We proceed to the improvement that allows us to substitute FR(S) for 
FR(T). Let n be a nonnegative integer, and let k S {0, ...,n}. We let 
B(n,k) denote the set remaining after the first k horizontal layers are re- 
moved from B(n) both from top and bottom and the first k vertical layers 
are removed from B{n) from both the left and right. The set B(n, k) is an 
octagon until k reaches [n/2], at which point B(n,k) becomes a square. 

It is helpful to allow the notation * € {<, =, >} for the next two theorems. 

Theorem 7.3. For each * £ {<, =, >} and for each k with < k < [n/2] , 

FR(B(n, k + 1)) * FR(B(n, k)) iff FR(B(n, k + 1)) * (2k + l)/2 

iff FR(B(n, k)) * (2k + l)/2. 



Proof. The set B(n,k + 1) is gotten from B(n,k) by removing four layers 
from B(n, k). Removing one layer removes 2k + 1 vertices and decreases the 
number of boundary edges by 2. So B(n, k) has x = 4(2k + 1) more elements 
than B(n, k + 1) and y = 8 more boundary edges. Hence if FR(B(n, k)) = 
7fc/(5 fe , then 

Ik Ik+i + x 7fc + i 7^ - x 

— = and = . 

Ok o k+1 + y d k+1 Ok~y 

A bit of algebra now yields that 

— * — lit — * — in — * — . 

dk+i Ok Ok+i y h y 

This proves Theorem 17.31 □ 
Theorem 7.4. Let k^ be the smallest integer greater than 

(2n - V2n 2 + 2n+ l)/2. 

ThenO < ko < n/2 and FR(B(n,k)) increases monotonically on the interval 
< k < ko and decreases monotonically on the interval ko < k < [n/2] . 

Proof. It is an easy matter to calculate FR(B(n,k)): 

, , 2n 2 + 2n + 1 - Ak 2 
FR(B(n, k)) = i- . 

Thus 

2k -i- 1 

FR(B(n,k))*——- 
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if and only if 

2n 2 + 2n + 1 - Ak 2 * 8nk + Ak - 8k 2 + An + 2 - 4k 
if and only if 

Ak 2 + (-8n)k + (2n 2 - 2n - 1) * 0. 
Set p(x) = Ax 2 — 8nx + 2n 2 — 2n — 1. The smaller root of is 

8n - A/64n2 - 16(2n 2 - 2n - 1) _ 8n - V32n 2 + 32n + 16 

8 ~ 8 

_ 2n - V2n 2 + 2n + 1 

~~ 2 

Since the larger root of p(x) is positive and p{— 1) > 0, it follows that 
&o > 0. One verifies that p{^^) < 0. If n is odd, then this implies that 
ko < (n — l)/2 < n/2 and if n is even, then ko < n/2. Thus < ko < n/2. 
Moreover, from the last display we conclude that FR(B(n, k)) > (2A; + l)/2 
for k < k and FR(B(n,k)) < (2k + l)/2 for k < k < [n/2]. (Note that 
since every term in p(a?) is even except the last, p(x) has no integer roots.) 
Therefore the desired result follows from Theorem 17.31 □ 

Theorem 7.5. For every integer k with < k < ko, the set B(n,k) \ 
B(n, k + 1) is a peeling for B(n, k). 



Proof. Theorem l7.4l implies that FR(B(n, k)) increases monotonically on the 
interval < k < k$. This and Theorem 17.31 imply that FR(B(n,k + 1)) > 
(2/c + l)/2 if < k < ko. Let P = B(n,k)\B(n,k + l). Because k < n/2, it 
follows that the graph T' p is the disjoint union of four graphs, one for each 
of the four layers in P. Theorem 17.11 applies to each of these four layers, and 
we see that its conclusion holds even for P. As in Corollary 17. 2} it follows 
that P is a peeling for B(n, k) if and only if m/2 < FR(B(n, k + 1)), where 
m = 2k + 1. This completes the proof of Theorem 17.51 □ 

Theorem 7.6. The set B(n,ko) is asymptotically a regular octagon. 



Proof. The set is clearly an octagon with two horizontal sides of lengths 
approximately 2/co, two vertical sides of lengths approximately 2ko, and four 
diagonal sides at angles of forty-five degrees from the axes, each having (as 
yet unknown) approximate length i. We want to show that i ~ 2k$. Each 
diagonal side is the hypotenuse of an isosceles right triangle whose legs are 
approximately of length n — 2k§. We calculate: 

n _ 2n _ 2 2 

k ~ 2n - V2n 2 + 2n + 1 ~ 2- ^2 + 2/n+l/n 2 ~ 2 - y/2~' 

Thus, 

' V2 ( "- 2t °>~V2(^-2W 



\2-\/2 

We conclude that the octagon is almost regular for large values of n. □ 
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Theorem 7.7. Let ko be the smallest integer greater than 

(2n - \/2n 2 + 2n + l)/2. 

T/ten B(n,ko) is a F0lner- optimal subset of B(n). That is, the F0lner ratio 
of B(n,ko) is the maximal F0lner ratio of subsets of B(n). Every optimal 
cooling function on B(n,ko) can be extended to a cooling function on B(n) 
without increasing the cooling norm. 

Remark. The following are the major steps in the proof. 

(1) The set B(n, |~n/2~|) is a square and is self-optimal because every xy- 
rectangle in Z Z is self-optimal. 

(2) If ko < k < [n/2] and B(n,k + 1) is self-optimal, then B(n,k) is also 
self-optimal. 

(3) If < k < ko, then B(n, k) \ B(n, k + 1) is a peeling for B(n, k). Hence 
each cooling function on B(n,k + 1) can be extended to a cooling function 
on B(n, k) without increasing the cooling norm. 

Steps (1) and (2) will show that B(n,ko) is self-optimal. Step (3) will then 
show that B(n,ko) is a F0lner- optimal subset of B{n). 

Step (3) is Theorem \ 7.5\ Step (1) is easy and will be carried out next. 
Step (2) requires a fair amount of further work which we shall carry out 
after Step (1). 

For both Step (1) and Step (2) of the proof, we will manipulate what we 
call difference diagrams. For the first time in this paper we will make use of 
the fact that we are considering the Cayley graph V = T{G, C) of an infinite 
group G with finite, torsion-free generating set C. If S is a finite subset of 
the set of vertices G of T, and if / : E(S) — > R is a cooling function, then 
we associate with / a difference diagram D as follows: 

D:SxC->l 

D(s,c) = f((s,c,sc)) - f((sc-\c,s)). 

That is, D(s,c) gives the net heat loss at s along the orbit of G defined by 
the group generator c. The difference diagram is a refinement of the heat 
loss function at a vertex that we used in previous sections. We call D a 
diagram since its values can be recorded at the vertices of the graph T in a 
pictorial diagram. In order to make the relationship completely clear with 
the heat loss function h(s) defined earlier, we note that 

Hs)= Y, /( e )" E f(e) = Y. D ^ c )' 
i(e)=s t(e)=s ceC 
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We need only this simple difference diagram in carrying out Step (1) of 
the proof. However, in Step (2) we shall consider the difference between 
difference diagrams defined on two different sets, one containing the other. 
In order to do this, we shall have to extend by the difference diagram on 
the smaller set so that the two difference diagrams have the same domain of 
definition. This new difference will be called a diagram of second differences. 

Theorem 7.8. The cooling function f is determined by its difference dia- 
gram and the values of f on dE(S). 

Corollary 7.9. // S is self-optimal and f : E(S) — >• M is a cooling function 
of norm FR(S), then f is determined by its difference diagram. 



Proof |7.<g| and 7.9). The corollary follows immediately from the theorem 
since, in the case of self-optimal sets S, the boundary values of optimal 
cooling functions are constant and equal to FR(S). 

In general, since every generator c E C has infinite order and the set S is 
finite, every c-orbit determines edges e%, e2, ... ,e n , with the initial vertex 
i(ei) of e\ and the terminal vertex t(e n ) of e n lying in the complement of 
S, all other vertices lying in S and t(ej) = i{ei)c for i G {1, . . . ,n}. The 
edge e\ is a boundary edge so that f[e\) is given. Inductively, /(e^+i) = 
f(ei) + D(t(ei),c). □ 

Remark ((Important!)). In the self-optimal case, it is important to real- 
ize that heat loss is always outwards at the boundary. This means that in 
the previous paragraph, the first value f{e{) = —FR(S) must be thought of 
as a negative number and the terminal value f(e n ) = FR(S) as a positive 
value, with intermediate values /(ej) increasing, not necessarily monotoni- 
cally, from the one value to the other. It is important that in its possibly 
oscillatory traverse from the negative value to the positive value, the values 
/(e,) never stray from the interval [—FR(S),FR(S)]. In terms of the dif- 
ference diagram, this means that the partial sums along initial segments of 
the orbit remain in the interval [0,2 • FR(S)], with the total sum equalling 
2-FR(S). 

In the simplest cases, along each orbit the difference diagram exhibits a par- 
tition of [0, 2 • FR(S)] ; that is, the values of D are positive and sum to 
2 ■ FR(S). However, a partition does not always suffice. When finding a 
F0lner- optimal subset of B(n), we shall often have to employ an oscillatory 
traverse along certain orbits with D exhibiting both positive and negative 
values. The first case where this seems to be necessary is -8(8). 

Theorem 7.10. feG = Z©Z = (a, b\ a6a _1 6 _1 = l), every xy-rectangle 
R is self- optimal. 
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Proof. Suppose R denotes the xy-rectangle ([1, m] x [l,ji])nZ 2 . Then 

„„, m ■ n 

FR(R) = — -. 

v ; 2(m + n) 

We express our cooling function in terms of the unit l/(2(m + n)) so that 
our difference diagram can have integral entries. Boundary values are thus 
given (in terms of the prescribed units) by ±m-n and the difference diagram 
must have initial partial sums that traverse from to 2 ■ m ■ n, never leaving 
the interval [0, 2 • m ■ n]. The negative sign is used near initial points of 
orbits, the positive sign near terminal points of orbits. A difference diagram 
is given by the function 

D{{x, y), a) = 2 • n and D{{x,y),b) = 2 • m. 

Since D{{x, y), a) + D((x, y),b) = 2 - (m + n), each vertex is cooled by exactly 
2(m + n) fractional units, hence by exactly 1 real unit. Along a-orbits and 
6-orbits, the difference diagram partitions 2 • m ■ n. Hence, the cooling norm 
is FR(R). □ 

Theorem 7.11. The set B{n,k) is self-optimal for all integers k in the 
interval [ko, [n/2]]. 



Proof. Since the rcy-square B(n, \n/2]) is self-optimal by the previous the- 
orem, it is clearly enough to prove that, if [n/2] > k > ko and B{n,k + 1) 
is self-optimal, then B(n,k) is also self-optimal. Let k be an integer in the 
interval [ko, [n/2] — 1]. 



We introduce B(n,k + l,i), for i G {0,1,2,3,4}, as the set formed from 
B(n,k + 1) by adding i of the layers of B(n,k) not in B(n,k + 1). We 
let FR(B(n,k + l,i) = cti/fii. It suffices to show that, if B(n,k + l,i) is 
self-optimal, then B(n,k + l,i + 1) is self-optimal, for i G {0,1,2,3}. We 
shall need the inequalities given in the following lemma. 

Lemma 7.12. If j G {0,1,2,3,4}, then 

{2k + l)(/3j - 2{2k + 1)) < 2 aj < {2k + 



Proof. To prove the second inequality, suppose that FR(B{n, k)) = 7fc/5fc, 
as in the proof of Theorem 17.31 Because ko < k < — 1, Theorems 17.31 
and 17.41 combine to imply that jk/^k < (2/c + l)/2. Hence 

2aj = 2{ lk +j{2k + 1)) < {2k + l){5 k + 2j) = {2k + 

This proves the second inequality. 



We prove the first inequality first for the case j = 4, where B{n, k + 1,4) 
B{n, k). The inequality is then equivalent to the inequality 

{2k + l)(/3 4 - 2(2A: + 1)) < 2a 4 . 
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Substituting known values for 04 = \B(n,k)\ and ^4 = \dE(B(n, k))\, as in 
the proof of Theorem 17.41 we seek to prove that 

(2k + l)(8n + 4 - 8k - 2(2k + 1)) < 2(2n 2 + 2n + 1 - 4k 2 ). 

This reduces to 

4/c 2 + (2 - An)k + (n 2 - n) > 0, 

equivalently, 

(2k-n)(2k- (n - 1)) > 0. 
Since fe < |~n/2] — 1 < (n — l)/2, the proof is complete for j = 4. 

Now we induct downward for j = 3, 2, 1, 0. In the original inequality, before 
manipulation, each move downward subtracts exactly 2(2k + 1) from each 
side of the inequality. This completes the proof. □ 

Assuming inductively that B(n, k + 1, 1) is self-optimal, we let D denote the 
difference diagram of some optimal cooling function for B(n,k + l,i). We 
employ as unit and assume inductively that we can choose the entries 
of D to be integers. We extend this difference diagram by to the larger 
set B(n,k + l,i+l)xC. 

We seek a difference diagram D + , expressed in units l//3j + i = l/(/3i+2), that 
will exhibit B(n, k+ 1, i + 1) as self-optimal and ask ourselves the properties 
that must be satisfied by the second difference E = D + — D. Even though 
the two diagrams are expressed in different units, each will have integer 
entries, and we express E as the integer difference of those entries. In other 
words, we manipulate the number of heat loss units required and compare 
the number of units in the two diagrams. 

We label the rows of B = B(n, k + 1, i + 1) by symbols Ro, R±, ... from top 
to bottom and assume that Ro is the new row with 2k + 1 elements. 

We label the columns of B that contain an element of Ro by symbols C±, 
. . ., C2/C+1 from left to right and call these columns the central columns. 

We label the other columns of B by symbols Si, S2, • • • from left to right 
and call these columns the side columns. 

We view the elements of B as cells for holding heat loss tokens. Our task is 
to deposit heat loss tokens in these cells subject to the following conditions. 

(1) Row Ro is assigned 2 • ctj+i tokens. This new row Ro had no heat loss at 
all assigned by the cooling function for B(n,k + 1, i); hence the difference 
function E must account for the total difference 2 • aij + i required along every 
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orbit. All other rows and columns are assigned only 2(2k + 1) tokens. This 
requirement reflects the fact that the old rows and columns require exactly 
2(«j + i — Oj) = 2(2k + 1) more heat loss units than assigned to that row or 
column by the cooling function for B(n, k + 1, i). 

(2) After distribution, each cell of i?o is to have fy+i tokens. At these vertices 
of the new orbit Ro, the total heat loss recorded by E must be since 
there was no heat loss there with the cooling function for B(n,k + 1, i). All 
other cells are to have 2 tokens. This requirement reflects the fact that the 
old vertices require only — (3i = 2 more heat loss units than assigned to 
them by the cooling function for B(n, k + 1, i). 

Heat-loss token distribution in the new row Rq: Dividing, we find 

2-a i+ i = (2k + l)q + r, 

where the quotient q and the remainder r are integers and < r < 2k + 1. 
That q < follows from the second inequality of Lemma 17.121 with j = 
2+1. That Pi + \—q < 2(2fc+l) follows from the first inequality of Lemma l7.12l 
with j = i + 1. 

Thus we may place q of the tokens assigned to the row -Ro hito each cell of 
Ro, with one extra in each of the first r cells. We may then place either 
— q (or /3j + i — q — 1, as appropriate) column tokens into the cells of Rq 
so that the total number of tokens in each cell is A+i, as required. 

Distribution of the heat-loss tokens associated with the side columns: 

In side column Sj, place 2 of the tokens assigned to Sj into each of the bot- 
tom 2k + 1 cells of Sj. 

The critical region: Almost always there will be cells of the side columns 
which still contain no heat loss tokens. These cells will form two triangular 
regions. These two triangular regions together with the portions of the 
central columns at the same height form what we call the critical region. 
See Figure [Q 

Distribution in the central columns beneath the critical region: 

Consider a row Rj that lies beneath the critical region. The cells in that 
row which have not yet been filled with two heat loss tokens are precisely 
those in the central columns. Place 2 of the heat loss tokens assigned to Rj 
into each of those cells. This will use up all of the 2(2fe + 1) heat loss tokens 
assigned to that row. 

Distribution of heat-loss tokens in the critical region: Fill that por- 
tion of the triangular regions in row Rj with 2 tokens assigned to Rj in 
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Figure 1. The set 5(9,4,4) = B(9,3), whose critical re- 
gion is within the shaded polygon. 

each cell. For large n, the number of tokens needed for this distribution 
exceeds the number, 2(2k + 1), of tokens assigned to Rj. For each extra 
token needed, we create a new heat loss token and a compensating negative 
heat loss token. These negative heat loss tokens will be distributed in the 
intersection of the row Rj with the central columns. 

In this paragraph we verify that the partial sums for E along initial segments 
of Rj stay in the range [0, 2 • (2k + 1)]. It will then follow that the partial 
sums along initial segments of Rj for the associated cooling function stay 
in the range [—FR(B(n, k + 1, i + l)),FR(B(n, k + 1, i + 1))]. The number 
of side columns left of the central columns which intersect Rj in the critical 
region is at most n — 2k — 1 . In order for the partial sums for E along initial 
segments of Rj to be at most 2 • (2k + 1), we need n — 2k — 1 < 2k + 1. 
This is equivalent to k > (n — 2) /4. As in the proof of Theorem 17. 4^ we let 
p(x) = Ax 2 — 8nx + 2n 2 — 2n — 1, and we verify that p(n) < and 

fn-2\ (n-2) 2 
p ^_J=L_J_ + 2n-l>0. 

Since k is between the smaller root of p(x) and n, it follows that k > 
(n — 2)/4. This verifies that our partial sums are at most 2 • (2k + 1). 
Symmetry shows that they are nonnegative. Thus the partial sums for E 
along initial segments of Rj stay in the range [0, 2 • (2k + 1)]. 
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It remains only to distribute the remaining positive and negative row tokens 
and the remaining central column tokens to fill the central portion of the 
critical region. 

We put all of the remaining column tokens in a single pot. We know exactly 
how many row tokens remain to be placed in each row, and in each row either 
every remaining token is positive or every remaining token is negative. This 
tells us exactly how many column tokens need to be placed in each row. We 
assign that number of column tokens to each row. Three paragraphs below 
we show that the pot contains exactly the correct number of tokens to do 
this. 

We distribute the column tokens assigned to row R\ as follows. We place the 
first column token assigned to row R\ in column Ci, the second in column 
C2, the third in column C3, and so on, continuing to column C\ if necessary, 
until the column tokens assigned to R\ have been exhausted. 

Augment row and column numbers by 1 from where the last column token 
was inserted. Place the column tokens assigned to that row in successive 
columns. Iterate. When we are done, every column will contain the same 
number, 2k + 1, of column tokens. Then we distribute the remaining row 
tokens so that the total value of the tokens in each of these cells is 2. 

Now we show that the pot contains the correct number of tokens. Suppose 
that all of the tokens are distributed, possibly leaving some of these cells 
without the required two tokens. After distributing all of the tokens, the 
heat loss along every row and column is 2on + \ units. There are Pi+i/2 
rows and columns, so the total number of units is aj+i/^+i. Since there 
are CKj+i cells, the average number of units per cell is It follows that 

every cell has received the needed number of tokens. Similarly, if every cell 
has received the needed number of tokens, then the average heat loss along 
every row and column is 2aj + i units. This means that no tokens remain. 
The construction is then complete. 

It remains only to note that the column distributions are partitions (no 
negative values for E). Although the row sums oscillate because of the 
negative tokens, again the norm stays in the desired bounds. Hence the 
associated cooling function has the correct norm. 

Since we have constructed the required second difference E, we conclude 
that it is possible to construct a cooling function for B(n, k + 1, i + 1) that 
has norm FR(B(n, k + 1, i + 1), and the proof is complete. □ 
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